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ABSTRACT 

We investigate the large-scale distribution of galaxy clusters taken from sev- 
eral X-ray catalogs. Different statistics of clustering like the conditional cor- 
relation function (CCF) and the minimal spanning tree (MST) as well as void 
statistics were used. Clusters show two distinct regimes of clustering: 1) on scales 
of superclusters (~ 40/i~^ Mpc) the CCF is represented by a power law; 2) on 
larger scales a gradual transition to homogeneity (~ 100/i~^ Mpc) is observed. 
We also present the correlation analysis of the galaxy distribution taken from 
DR6 SDSS main galaxy database. In case of galaxies the limiting scales of the 
different clustering regimes are 1)10-15 Mpc; 2) 40 — 50/i~^ Mpc. The differ- 
ences in the characteristic scales and scaling exponents of the cluster and galaxy 
distribution can be naturally explained within the theory of biased structure for- 
mation. We compared the density contrasts of inhomogeneities in the cluster 
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and galaxy distributions in the SDSS region. The value of the density contrast 
should be taken into account to reconcile the observed gradual transition to ho- 
mogeneity with the apparent presence of structures on the corresponding scales. 
The estimation of the relative cluster-galaxy bias (comparing number of clusters 
in different SDSS regions with corresponding number of galaxies) gives the value 
b = 5 ± 2. The distribution of real clusters is compared to that of simulated 
(model) clusters (the MareNostrum Universe simulations) . We selected a cluster 
sample from 500 Mpc simulation box with WMAP3 cosmological parameters 
and (78 = 0.8. We found a general agreement between the distribution of ob- 
served and simulated clusters. The differences are mainly due to the presences of 
the Shapley supcrclustcr in the observed sample. On the basis of SDSS galaxy 
sample we study properties of the power law behavior showed by the CCF on 
small scales. Wc show that this phenomenon is quite complex, with significant 
scatter in scaling properties, and characterized by a non-trivial dependence on 
galaxy properties and environment. 

Keywords: galaxies, galaxy clusters, large-scale structure of the Universe, dark matter 
simulations 



1. Introduction 

Clusters of galaxies are the most massive virialised structures in the Universe. Within 
the framework of the modern "picture of the world" with dark matter and dark energy 
(cosmological constant) the structures evolve via gravitational instabihty starting from small 

seeds which are described by the spectrum of initial inhomogeneities. Clusters of galaxies are 
perfect probes of the matter distribution on large scales. Studying their spatial distribution 
one can constrain the parameters of the ACDM model {Qm, (^s)- Baryonic gas falls into 
cluster potential wells and heats up to temperatures of the order of 10^ K so that it emits 
X-rays. Clusters of galaxies were historically identified first as overdensities in the galaxy 
distribution (AGO [1] and APM [2] optical galaxy cluster catalogs). This kind of selection 
leads to some spurious objects due to projection effects. When clusters are detected according 
to their X-ray flux we undoubtedly deal with deep potential wells. In this work we investigate 
the distribution of galaxy clusters selected by X-ray flux from several catalogs of X-ray 
clusters and compare it with simulated clusters taken from the " MareNostrum Universe" [3] . 
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Fig. 1. — Distribution in equatorial coordinates of 400 X-ray clusters with redshifts 2; < 0.1 
and Lx{0.1 — 2AkeV) > 1.25 ■ 10^^/i~^ erg/s. The edges of minimal spanning tree shorter 
than 45 Mpc are shown by solid hues. Circles of constant galactic latitude {b — —20°, 
0°, +20°) are plotted by dotted lines. Gray circles represent the spectral plates of the SDSS- 
DR6. 

2. X-ray galeixy cluster sample 

The X-ray cluster sample consists of the all sky ROSAT clusters with X-ray flux Fx > 
3 • lO"-*^^ erg/cm^ /s in the (0.1-2.4 keV) energy band. Clusters were selected from the 
following catalogs: 1) REFLEX (N=186 clusters) [4]; eBCS (N=108) [5,6]; NORAS (N = 
36) [7]; CIZA (N = 70, galactic latitude \bgai\ < 20°) [8,9]. The final all-sky sample consists 
of 400 X-ray clusters up to rcdshift z = 0.1 with himinosities Lx > 1.25 ■ /i^^lO^'^ crg/s 
(assuming the current rate of universal expansion - Hubble constant Hq = 100 km/s/Mpc, 
h=H//Jo) where H is the true value of Hubble constant). The volume-limited sample (VL) 
extracted from this compilation contains 233 X-clusters with redshifts limited by Zyi = 0.09 
(which corresponds to the radial distance of 265.3 Mpc {flm = 0.24, fl/^ = 0.76) with 
Lx > 2.5-/i-2l0^^erg/s. 
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Fig. 2. — Distribution of X-ray clusters with z < 0.1 and Lx(0.1 — 2AkeV) > 1.25 ■ 
lO'^'^'h^'^ erg/ s around the Northern Galactic Pole. The edges of minimal spanning tree shorter 
than 45h~^ Mpc are shown by solid lines. Circles of constant galactic latitude (6 = 0°, +20°) 
are plotted by dotted lines. Gray circles represent the spectral plates of the SDSS-DR6. 

3. Model cluster sample 

Model clusters The MareNostrum clusters (MN-clusters) were extracted from the 500 
h^^ Mpc simulation box MUWHS [10] with cosmological parameters = 0.24, Q\ = 0.76, 
h=0.73, (Tg = 0.8 (cg, the present-day rms mass fluctuations on spheres of radius 8h^^ Mpc is 
slightly higher than predicted by WMAP3 and in better agreement with WMAP5). Within a 
box of 500h^^ Mpc size the linear power spectrum at redshift z = 40 has been represented by 
512^ DM particles of mass ttiom = 8.3 • 10^h~^Msun (In the following we assume h — 1). The 
nonlinear evolution of structures has been followed by the GADGET II code of V. Springel 
[11]. Clusters were identified in the simulation by the FOF (friend-of-friend) algorithm. For 
comparison with observations we extracted the 233 most massive clusters in a sphere of radius 
265.3 Mpc (we slightly expanded the simulation box using the periodic boundary conditions). 
We have chosen this simulated sample by keeping its number density equal to the observed 
cluster density (independent of an Lx - mass relation) so that the most massive simulated 
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clusters correspond to the most luminous observed clusters. We use the 3D velocities of 
the simulated clusters and place an observer to the center of the sphere extracted from the 
simulation box. Then the cluster positions were converted to redshift space. The mass of 
the lightest dark matter cluster in the simulated sample is 2.46 • lO^^Mgun- 



4. Statistics of clustering 

4.1. Conditional correlation function 

In order to compare the X-ray and simulated cluster distributions we use different sta- 
tistical methods. The conditional correlation function (CCF) [12,13] measures the number 
density of objects in spheres of radius R averaged over the spheres around all objects of 
the sample that are away from sample boundaries by more then R Mpc (integral CCF). 
The clusters show three distinct regimes of clustering (Fig. 3a): 1) on scales of superclusters 
up to a scale of 35-40 Mpc the CCF is represented by a power law; 2) on larger scales a 
gradual transition to homogeneity is observed; 3) starting from about 100 Mpc the CCF be- 
comes a constant, i.e. the number density does not anymore decline with increasing radius 
of spheres. Fluctuations on scales > 100 Mpc exist but evidently they doesn't contribute to 
cluster number density contrasts on such scales. Here we reach a mean number density of 
clusters (which can not be obtained on smaller scales) in our VL-sample. 
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Fig. 3. — a) Comparison of CCFs for X-clusters and MN-clusters; b) CCFs for VL cluster 
sample in 2 hemispheres: GAC (123° < Igal < 303°) and GC (303° < Igai and Igai < 123°). 
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The CCFs for observed and simulated clusters look quite similar (the second regime of 
clustering from ~ 40 to ~ 100 Mpc is perfectly reproduced by the simulated clusters) however 
the value of the slopes on scales below 40Mpc are different: 7x ~ 1.6 for the observed and 
7m7V ~ 1-25 for the simulated clusters. This difference reflects the lack of close pairs in the 
simulated cluster distribution with respect to the observed ones. The comparison with a 
model cluster distribution obtained from the same realization but using a smaller linking 
length of the FOF algorithm (in order to identify substructure and possible close pairs 
which could be linked by the original linking length) showed that the FOF parameters have 
neghgible influence on the value of the slope. In order to understand the difference in the 
slopes we have gathered observed cluster pairs with separations smaller than 5 Mpc into single 
objects. This reduces the total number of objects in the observed sample by 10. Mostly the 
clusters that belong to the Shapley superclustcr (located at Igai ~ 311°, bgai ~ +30° and 
redshift z ~ 0.05) (Fig. 6) were linked. The CCF of the reduced sample (Fig. 3a) looks nearly 
identical to the simulated one {'-/mn is very close to the 7^ of the reduced observational 
sample). When we calculate CCF separately in two hemispheres we obtain the value of 
slope ■jGAc very close to the model ^mn in the GAC hemisphere that doesn't contain the 
Shapley supercluster (Fig. 3b). This means that the difference is mainly produced by the 
Shapley supercluster - we don't have such a structure in the simulated cluster sample. It is 
an open question how often such oTitstanding structures appear in the Universe and whether 
they can be reproduced by ACDM simulations of larger volumes which should contain larger 
wavelength perturbations that could be responsible for formation of more massive objects. 
Therefore, still, it is a question if we really see in our VL-sample with CCF the universal 
mean X-ray cluster number density. 



4.2. Void statistics 

We have also performed a void analysis of the same observed and simulated samples. 
Starting from the largest empty spheres non-spherical voids have been constructed by ex- 
tending the original spherical void with empty spheres of smaller radii the center of which 
was inside the original void. The radius of the smaller spheres is limited to be larger than 
an ad hoc parameter 0.9 of the radius of initial sphere. The process is repeated a few times. 
It produces voids which are slightly non-spherical. The mean distance between the observed 
(and simulated) clusters is ~ 28 Mpc. Therefore, we have limited our voids to a minimal 
radius of 20 Mpc. The cumulative void volume functions (CVF) AV/Vgampie show that the 
observed and simulated voids flll the sample volumes in a similar way though in Fig.4a we see 
a difference at Rvoid ~ 80 Mpc: the largest simulated voids arc bigger than the observed ones. 
Here the differential void function (DVF) is presented as Rvoid versus rank [14] (largest void 
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has rank 1, Rvoid — (3 • Koici/47r)^/^). It shows rather good agreement between observation 
and simulation. There are two breaks in the DVF (Fig.4b). The one at Rvoid ~ 45 Mpc (less 
prominent) is associated with the scale where the power law regime of clustering vanishes. 
The break at R^oid ~ 100 Mpc can be directly associated with the scale of homogeneity. 
The slope of the Rvoid — Rank relation after the break is 2;^ = 1/(3 — ■jvoids) which gives 
Ivoids ~ 1.2 close to the slope of the CCF on small scales. 
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Fig. 4. — a) CVF for X-ray and MareNostrum-clusters; b) DVF for X-ray and MareNostrum- 
clusters. 



4.3. Cross-correlation of clusters and galaxies 

From SDSS DR6 main galaxy database we selected a region in galactic coordinates 
(48° < Igai < 210°, 50° < bgai < 86°) and built a VL sample with Zmax = 0.1 and Mum = 
— 19.67 (see section 5). There arc 23 X-ray clusters from our compilation in this region. In 
Fig. 5b we present the cluster-galaxy cross-CCF (clusters were used as centers of spheres in 
which the number of galaxies inside was calculated) and compare with the CCF of SDSS 
galaxies in the selected region (star symbols in Fig. 5b). We defined a local number galaxy 
contrast A = 11 / {An R^q/ 3) /pmean — 1, where Rio is the distance to the 10th neighbour 
(it!io ~ 4 Mpc is the mean value for entire sample) and Pmean is the mean galaxy number 
density in the sample. In this regard, the clusters located in the sample have a median galaxy 
constrast of A ~ 40. We have randomly selected 23 galaxies separated by more than 10 Mpc 
and with A < (10 realizations) located in void regions. The filled triangles in Fig5b show 
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the mean cross CCF for these low density regions. We see the same scale of plateau on CCF 
at about 40 Mpc independently of the way of the calculation. The cluster-galaxy cross-CCF 
shows a stronger correlation than observed in the entire galaxy population (CCF for SDSS 
galaxies) and inherits from the cluster-cluster correlations the length of the scaling regime. 
Note that all three curves in Fig. 5b, that were calculated in rather different ways converge 
to the homogeneity regime at the same scale of ~ 40 Mpc. 
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Fig. 5. — a) MST analysis for X-ray and model MN-clusters; b) CrossCCF clusters-galaxies 
(open circles), CCF of SDSS galaxies (stars; error bars are smaller than symbol size), low 
density region galaxies crossCCF (triangles, 10 realizations). 



4.4. Minimal Spanning Tree 

We built the minimal spanning trees (MST) of the cluster samples. The MST consists 
of knots and edges and is constructed by appending new knots satisfying the condition for 
the distance to the already constructed part of the tree being at a minimum [15]. The 
MST and void analyses give us a clue to outline the "skeleton" of structures represented by 
clusters. The full length of the truncated MST when only knots having more then 1 edge 
left, normahzed to the number of such knots is = 37 Mpc for the X-ray cluster sample 
and L^MN = 38 Mpc for the simulated sample (for comparison, random samples with the 
same number density give after averaging = 47 Mpc). Using the MST linking lengths 
LL — 20, LL — 30 and LL — 40 Mpc we have constructed the cumulative functions of 
structure abundances (Fig.Sa) for the observed, simulated and randomly generated samples. 
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Clusters in the observed sample are slightly more structured than the simulated ones (largest 
differences are for LL = 20 Mpc which reflects the same effect we discussed already before 
(section 4.1): a lack of close pairs in the simulated sample). But, the largest structures in 
both samples (for LL = 30 and 40 Mpc) have nearly the same number of clusters: we sec an 
overall agreement of abundances of observed and simulated structures detected in the levels 
of connectivity chosen. Again we see large deviation with respect to the Poisson sample for 
both values of LL which is another signature of clustering in our samples. 




Fig. 6. — X-ray luminous (Lx(0.1 — 2AkeV) > 1.25 • 10^^/i~^ erg/s) member clusters of 
Shapley supercluster (0.039 < z < 0.059) connected by the edges of minimal spanning tree 
shorter than 40/i~^ Mpc. 



5. CCF of galaxy samples from the SDSS DR6 main galaixy database 



When analyzing the SDSS DR6 data, we have selected 3 rectangular regions from the 
region of spectroscopic sky coverage for the convenience of allowance for the boundary con- 



-10- 



ditions for CCF and for ensuring sample completeness. In the (A, 77) coordinate system of 
the survey, the selected regions are SI: 48° < A < 30°, 6° < 77 < 35°; S2: 25° < A < 48°, 
6° < 77 < 35°; S3: -54° < A < -16°, 33° < 77 < -17°. 

We constructed volume limited samples to eliminate the incompleteness in the radial 
coordinate. We have set the limit on the r-band absolute magnitude Mr for the sample of 

galaxies to Mn^ = rii„i~25~5log {R„iax{i+ Zmax)) ~ K (z) , where = 17.77 was taken as the 
limiting r-band magnitude, K(z) is the K-correction, and Rmax is the maximum value of the 
radial coordinate corresponding to z^ax, ( assuming = 0.7, Qm = 0.3). Therefore, we have 
in the VL-sample all galaxies with M,. < Mum- The r magnitudes used here were corrected for 
extinction. To estimate the absolute magnitudes of the galaxies, we used an approximation 
for K-correction for SDSS galaxies of the form K{z) = 2.3537^^ + 0.5735z + 0.18437 [16,17]. 
Here, we present results for two cuts on redshift (zmax)' VLl {z^ax — 0.12, M^j^ = —20.11) 
and VL2 = 0.15, Mum = -20.68) (Fig7a,b). 
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Fig. 7. — CCFs for different VL samples form DR6 SDSS main galaxy database. Ngal - 
number of galaxies in a sample. 



The CCF method deals with spheres that are fully included in the sample. For spheres 
with centers located at large radii they tend to be located close to each other. We have 
limited our analysis to the scale defined by the condition that spheres of large radii do not 
overlap by more than half of their volumes. As can be seen in the Figs, the power law regime 
is hmited to scales ~ 10 — 15 Mpc and the CCFs of different samples show rather concerted 
convergence to the homogeneity regime. We should note a small but distinct differences in 
the amphtudes of the homogeneous regime in the CCFs for three different regions. This 
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means that, at such scales, we can measure the mean density with some scatter that is 
caused by cosmic variance, i.e., the presence of different structures in different samples. The 
characteristic scales of galaxy correlations are significantly smaller than the ones produced 
by clusters in proportions that are similar to those obtained by early application of the more 
traditional two-point correlation function ^{r) (see e.g. [18]). Such differences have a natural 
explanation in the theory of biased galaxy formation. 

6. Estimation of the relative cluster-galaxy bias on 200 Mpc scale 

The theory of structure formation predicts that the clustering of the most massive 
dark matter halos (clusters) is enhanced relative to that of the general mass distribution 
(galaxies) [19, 20]. Fig.l shows large inhomogcncitics in the distribution of clusters on scales 
of 100 — 300 Mpc. In the northern region of the SDSS (Fig.2) we can estimate a relative 
clustering bias (b) for the volume-limited samples of clusters with Lx > 2.5 • 10^^ erg/s and 
galaxies with Mj. < —19.67. In three equal volumes of 5.2 • lO^Mpc^ defined in the redshift 
intervals 0.020-0.069, 0.069-0.087 and 0.087-0.100 for the area covered by the SDSS-DR6 
spectral survey (~ 6100 square degrees) there are 12, 32 and 13 clusters and 36612, 46785 and 
38566 galaxies, respectively. These counts give a rough estimate of cluster-galaxies relative 
bias 6 = 5 ± 2 on the scale of 200 Mpc. This estimate is consistent with the bias (b = 3) 
measured for massive halos (~ 3 • lO^'^Msun) in N-body simulations, but on scales of 15-30 
Mpc [21]. 

7. Scaling properties of galaxies 

For samples containing galaxies with absolute magnitudes less than M* the galaxy 
integral CCF behaves pretty much like a power law with an exponent ^gai ~ 1.0 (sec section 
5) up to scales ~ 10 — 15 Mpc. Galaxies more luminous than M* tend to be more clustered 
[13, 22, 23]. To investigate scaling properties of individual galaxies on such scales, we chose 
all galaxies from the sample (section 5) VLl-Sl {zmax = 0.12) that are located more than 
10 Mpc away from the sample boundaries. For each selected galaxy we calculated number 
N of neighbours in the spheres of radii Rgp (changing Rgp from 2 to 10 Mpc with step 
IMpc) and used linear approximation with slope snr of log{N) — log{Rsp) dependencies. 
We excluded from the analysis very isolated galaxies and galaxies that have an error in the 
slope, as, larger than 0.3 (taking into account the dispersion of slope values and excluding 
"bad" approximations). After completing this procedure, about 37000 galaxies were left. 
Surprisingly the mean slope = 1.7 (which also corresponds to the maximum in the 
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histogram of slope values) does not correspond to an exponent 751 ~ 1.0 of the CCF of the 
sample on scales ~ 10 Mpc. (For instance, in a homogeneous scale-invariant distribution, it 
should be = ^ — jccf)- The dispersion of the distribution of slopes is significant: a^R = 
0.6. By defining galaxy structures according to their density contrast and connectivity (using 
the abovementioned MST technique) wc came to the conclusion that it is hard to associate 
galaxies with slopes in a certain narrow range with identified structures - there is a complex 
mixture of slopes with significant dispersion, but the possibility to trace structures by their 
scaling properties is still an open question. We found some amount of slopes snr > 3: all of 
them are associated with galaxies in relatively low density regions with 5 < 10 (see section 
4.3). Following the approach of [24] we performed a multi-scaling analysis that weights 
high and low density regions in different way according to the positive or negative counts- 
weighting exponent q-1. The generalized dimensions Dg differs significantly for different 
values of q {Dg increases from = 1.1 to D2 = 1.8) which is another signature of the 
complexity of the scaling properties on small scales. Usually this effect is interpreted as a 
manifestation of multifractality. It is evident that galaxy clustering on small scales can not 
be described by simple models like the ones with a unique scaling exponent. 



8. Conclusions 



The application of different complementary statistics to samples of observed and simu- 
lated clusters of galaxies, chosen in a way to fit the observed number density of the volume 
limited X-ray cluster sample, show general agreement in the distribution of most luminous 
virialised objects in the Universe and most massive halos in cosmological simulations. Based 
on the CCF we found the same scale (~ 100 Mpc) of statistical homogeneity (in a sense in 
which we have a definite, but fluctuating, mean number density of objects on such scales) 
for observed and simulated clusters. This scale can be related to the comoving scale of the 
largest wavelength of the acoustic oscillation of the photon-baryon plasma before recombi- 
nation [25]. It is very interesting to note that the second (transitional) clustering regime 
(beyond the characteristic scale of superclusters) shown both by the observed and simulated 
CCF happen to be at the same scales of ~ 40 — 100 Mpc. The Shapley supercluster strongly 
affects the value of the CCF slope of clusters on small scales and it is responsible for the 
differences in the distribution characteristics of observed and simulated clusters: we see a 
lack of close massive cluster pairs in simulations. Larger computational boxes are neces- 
sary to flnd out Shapley-hke structures in simulations. The MST analysis shows that the 
observed clusters are slightly more structured than the simulated ones. The observed and 
simulated void functions agree except for the largest voids. In summary, the distribution of 
most massive ACDM dark matter halos show a reasonable agreement with the distribution 
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of most luminous X-ray clusters of galaxies. The significant differences in the characteristic 
scales of the distribution of X-ray clusters and SDSS galaxies (power law scales at 40 Mpc 
and 10-15 Mpc and homogeneity scales at ~ 100 and 40-50 Mpc, respectively) are similar 
to differences obtained in earlier works using the two-point correlation functions. They can 
be explained by the theory of biased structure formation. Our estimation of the relative 
cluster-galaxy bias value {b ~ 5) is in general agreement with the theoretical prediction. 
The power law behaviour of the decline of the galaxy density with distance indicated by the 
CCF (correlation exponent 7) on small scales has a very complex nature: a dependence of 
7 on colors and luminosities of galaxies, a significant scatter of individual exponents of the 
number-radius relation for different galaxies in the sample, and the evidences of multifrac- 
tality - differences in the scaling properties depending on environment (high and low density 
regions) . 
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